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BEST  RATIONAL  APPROXIMATIONS  WITH  NEGATIVE  POLES  TO  e”X  ON  [o,“) 
E.H.  Kaufman,  Jr.  and  G.D.  Taylor  . 

In  this  paper  a theory  for  approximating  e X on  fo,®)  with 
rational  functions  having  negative  poles  is  developed.  Numerical 
results  suggest  that  the  best  uniform  approximation  to  e x on 
Co,® ) from  this  class  has  only  one  pole  and  this  is  shown  to  be 
the  case  when  using  rational  functions  of  this  form  which  are 
linear  polynomials  divided  by  quadratic  polynomials.  Numerical 
results  are  given  and  compared  to  recent  results  of  Saff,  Sch’oa- 
hagc  and  Varga. 


1 Introduction 


Let  it  denote  the  space  of  all  real  algebraic  polynomials  of 
w 

degree  less  than  or  equal  to  m.  For  each  m = 1,2,...,  define  R 

m 

by 

m 

R * {R=P/Q:  PER  , Q(x)=  II  (q.x+l),  q..O  for  all  i}. 

® m- 1 . , l 1-' 

1=1 


Thus,  R is  the  collection  of  all  rational  functions  with  negative 


poles  from  R"1  *Co.°°).  Define  X by 


(1.1)  1.-  Lf{||.  - *1  1 R„>.  • 

It  is  known  that  X converges  geometrically  to  zero  (i.e.  lim 
1 /in  m-*™ 

Xm  ■ 0)  since  Saff,  Schonhage  anti  Varga  [5^  have  proved  that 

there  exists  a sequence  { R } with  R (x)  * P ,(x)/(l*  — )m, 

m m*l  m m-l  m 

P . £ , such  that 

m-l  m-l 


3-2  *^T  _<  i ltn  J | e 


-x 


lll/m  l 

Rm  L^fo,®)  - 2 * 


In  addition,  since  the  poles  of  k (x)  are  all  real  it  follows  that 

01 

Rb(z)  must  converge  geometrically  to  e”Z  in  an  infinite  sector 
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In  this  paper  a theory  for  approximating  e x on  ^0 , »)/  with  rational 
functions  having  negative  poles  is  developed . Numeric** results  suggest  * 
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In  what  follows,  we 


symnetric  about  the  positive  x-axis  Cel 
define 


(1.2)  U ■ inf { | |e 


"x p(x)  I I oor 

(1+  x)m  L Lo,°°) 


P e Vl*- 

m- 1 


An  application  of  this  theory  is  in  the  construction  of  num- 
erical algorithms  for  solving  linear  systems  of  ordinary  differ- 
ential equations  which  arise  from  semi-discretization  of  linear 
parabolic  partial  differential  equations  (see  [0.  [5]  ) . Num- 
erically, this  reduces  to  an  iteration  of  the  form 

(1.3)  w(r)  - A_1k  + R (AtA){w(r_1)  - A_1kJ 

a*  a*  m a*  *» 

where  A is  a nxn  matrix  (band),  k and  w^^  are  n dimensional 

vectors  (n  is  related  to  the  stepsize  of  the  discretization) , At 

is  a scalar  and  R is  the  rational  function  defined  above.  Due 
on  ( r ) 

to  the  special  form  of  the  denominator  of  R , w can  be  obtained 

At  m ** 

from  the  repeated  inversion  of  (I  + — A)^+^  = 0 _<  t < m - 1 

using  an  appropriately  defined  e . This  is  an  attractive  method 

. ^ . At 

numerically,  since  an  LU  factorization  can  be  done  for  I + — A 
only  once  and  this  factorization  will  preserve  any  band  structure 
that  is  present. 

One  can  construct  a similar  numerical  method  using  a solution 
* * m 

R (x)  * P ,(x)/  II  (q.x+l)  to  (1.1).  The  apparent  disadvantage 

® O” 1 . _ | 1 ^ j 

of  such  a method  compared  to  that  of  is  that  w is  found 


from 


{ n (I+q.AtA)}w(r)-{  n (I+q.AtA)}A~lk+P*  (At A) {w(r_l >-A_1k} 
£■!  1 **  i-i  1 •*  «-l 

which  will  involve  a greater  number  of  operations  (though,  less 

than  m LU  factorization  and  2m  substitutions).  The  advantage  of 

this  method  is  that  \ will  be  smaller  than  p giving  increased 

“ m * 
accuracy.  However,  it  appears  (numerically)  that  actually 


•CCZiSICi  :;r 

1TIS 
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•MMOWCrj 
J> miFicuicii 


Se.liao 


ir 


f B“w«tfn«i/4if4fut»urr  cojk 

fait  ffltf  or  SmSSoT 
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3 


has  q,  ■ q ■ . . . = q and  X is  approximately  one  half  of  V 

l * m ^ P m 

for  m > 2.  Thus,  using  R gives  a method  that  has  the  same 

ID 

desirable  properties  as  that  of  (1.3)  and  increased  accuracy. 

In  the  next  section  we  shall  state  some  general  facts  con- 
cerning uniform  approximation  from  R^  (these  results  will  appear 
in  a future  paper  £3}),  give  a theoretical  treatmeht  of  best 
uniform  approximation  of  (l.l)  for  the  special  case  that  m * 2 
and  state  some  conjectures.  In  the  last  section  we  will  discuss 
our  algorithm  and  present  some  numerical  results. 


2 Theoretical  Results 


In  this  section  we  begin  by  giving  a existence  theorem  for 

approximating  from  R on  This  result  is  valid  for  a large 

in 

class  of  functions  (containing  e ).  We  shall  outline  a proof  for 
the  special  case  that  m ■ 2. 


THEOREM  2.1.  There  exists  R 

* X . 

ra 


e Ra  for  ahicb  | |e"x-R  (x)  I |l«£0  od) 


that  qln  ^ 0,  q2n  > 0 for  all  n and  >_  | |e  x 


THEOREM  2.2.  There  exists  R e Rj  for  which  | |e  -R  (x)  | Il“£q  ^ 

' V , 

Proof.  The  proof  begins  by  first  observing  that  X^  < — . Thus, 

let  ft>n},  tq  | n } and  {q2n  } (n°l,2,...)  be  sequences  such 

' a x+b 

n n | | 

(<»lnX+1)(<l2nX+l) 

I ^ as  n -►<*>,  where  we  will  no  longer  write  the  subscript  L £o,°°) 
on  the  norm  bars.  Next,  the  proof  is  divided  into  two  cases.  The 
first  case  is  when  the  sequences  {q^}  and  (q,^  } are  bounded.  In 
this  case  it  follows  that  the  sequences  {a^  } and  {b^  } are  also 
bounded  and  the  desired  result  follows  as  in  the  standard  rational 
approximation  theory.  '• 

Thus,  let  us  assume  that  the  sequences  {q^}  and  {q2n)  are 
not  both  bounded.  By  relabelling  and  extracting  subsequences  we 


uy  assume  that  <llnf  * *nd  q,n  i q2n  for  *11  *»•  In  addition,  by 


‘In 


2n 


1 


looking  at  the  error  curve  at  x ■ 0 and  x ■ -,  respectively,  we 

1 1 
have  that  -r  < b < 
z " n - 

(say  n nfl)  that 


1 3 

have  that  •?  < b < - for  all  n,  and  for  all  n sufficiently  large 

2 ~ n **  z 


(2.D  o < n < 


1 

2 *n 


" 4 


< M 


where  n and  M are  positive  constants  independent  of  n. 

Next,  we  claim  that  the  sequence  t^n^  mu8t  bounded.  In- 
deed, if  not  then  by  passing  to  subsequences  (and  relabelling)  we 
■ay  assume  that  qJn  + * as  n + ».  Let  x^  1 

t C0,«).  Note  that  x ^ 0 and  that 


ax  ♦ b 
n n n 


a x 

n n 


(q,  x ♦l)(q_  x + l)  r-  (q.  x *l)(q,  x +1) 
sln  n ’2n  n un  n ^2n  n 


' (,,  I i T Th“*-  ‘V  k' 


as  n + For  n sufficiently  large  this  contradicts  our  assump- 

a x ♦ b 

— X “ 

tion  that  I |e 

",ln“'*,",2n"' 

bounded.  Since  a -*•<*>  by  (2.1),  we  have,  reciprocating  (2.1), 
n 

that  there  exists  positive  constants  c^  and  c^  independent  of  n 

such  that  for  n > n.  > n. 

— 1 ■—  o 


(2.2) 


i V1 


< —is  <_ x_. 


By  (2.2)  we  may  extract  a subsequence  (and  relabel)  for  which 

-*•  q*  > 0 and  q,  /a  + c*  > 0 as  n + «.  Hence,  for  fixed 
1 — Mn  n 


*2n 


x e (0,»)  we  have  that 
a^x  ♦ b 
'In "*20 


n n » 1 

(q. .x+l)(q«  x+1)  c*(q*x+l ) 


By  continuity,  | |e  X - 1 1 <:  X ^ completing  the  argument.  | 


b* 


•q*x*i 


I 


5 


The  above  proof  (suitably  modified)  also  establishes  the 

/<•» 

following  corollary  where  R^  “ {R-P/Q:  P e V Q(x)=(qx+l)  , 
q > 0}. 

COROLLARY  2.3.  There  exists  R e R^  such  that  | |e  *-r| | = 
inf { | |e  X-R| | : R e R^} . . 


Next,  we  wish  to  turn  to  proving  that  for  the  m=2  case  the 
best  approximation  from  R2  is  actually  contained  in  R^.  To  do 
this  we  shall  first  show  that  neither  of  the  coefficients  in  the 
denominator  is  zero  and  that  the  numerator  and  denominator  do  not 
have  a common  non-constant  factor. 


THEOREM  2.4.  The  best  approximation  £0  e X from  R^  is:  not  of  the 

, ax+b  . 

form  r,  q > 0. 

qx+1  — 

Proof.  To  prove  this  we  use  some  computed  results.  First  of  all, 
running  the  Remes-Difcor  algorithm  as  described  in  [2].  we  found 

the  "best"  approximation  of  the  form  /ax+tj/ft ,x+q  } (wi  th  |q  | < 1, 

11  -x  „ . i .500  ' ! 1 * 

|q2|  < 1)  to  e on  X = ^IS  routine  returned  the 

values:  a --.0934430154,  b * .6698426328,  qL  « 1.0  and  q2  = 

.6330537047.  It  also  returned  four  extreme  points  x.  - 0.0,  x~  = 

“X ; ^ ^ 

.44,  Xj  * 2.76  and  x^  - 20.0  such  that  e 1 - ^ax.+b/q^ x>q2  J- 
(-l)Ie^  with  > .058  for  all  i*l,2,3,4.  Thus,  by  a de  La  Vallee 
Poussin  type  argument  we  have  that  inf(||c  X— r- 1 ! ^ °°^q 

r e R J £0 ,»-) > > .058.  Next,  setting  r*(x)  “ (a*x+b*)/(p*x+l ) with 
a*  - -.1853243706,  b*  - 1.022709327  and  p*  - .524169575  we  cal- 


culated y *■  max{|c  X-r*(x)|:  x c i/1000  for  0 £ i ^ 20,000}  and 
found  that  y < .023.  Next,  by  dividing  [o,2o)  into  [o,z}  and 
[z,2o]  where  z is  the  zero  of  r*(x)  we  are  able  to  show  that 
| E * (x)  | <_  3.2  on  CP,20)  where  E(x)  ” e X-r*(x).  Using  this  and 
the  above  value  of  Y with  Taylor's  theorem  for  linear  polynomials 
we  can  show  that  |E(x)|  < .0246  on  £o,2o}.  Since  E(x)  > 0 and 
E'(x)  < 0 for  x > 20,  we  have  that  |E(x)|  < E(20)  < .022  for 


i 


x >_  20.  This  completes  the  proof. 

Note  that  this  proof  also  shows  that  r*  is  a better  approxi- 
mation than  the  one  calculated  in  C$}  for  m-2.  Next,  we  turn  to 
proving  that  for  any  best  approximation  in  the  m=2  case,  the 
coefficients  in  the  denominator  coalesce;  that  * q^. 


THEOREM  2.5.  Any  best  approximation  to  e from  R?  on  [o,°°)  be- 

— . *2 

longs  to  is , it  i^  of  the  form /ax+ty/ ( qx+ 1 ) with  q > 0. 

Furthermore , q > 0,  and  the  numerator  and  denominator  have  no 
non-constant  common  factors . 

Proof.  The  facts  that  q > 0 and  the  numerator  and  denominator 
have  no  non-constant  common  factors  follows  from  Theorem  2.4. 

Let  R(x)  = (pj+p^x^lq^x+lHq^x+l)  be  a best  approximation  to  e 
on  CO,-)  from  with  0 < q^  < q2>  We  first  claim  that  e X - R(x) 
has  at  least  5 alternating  extreme  points  in  Co,a]  where  a is 
chosen  such  that  x ^ a implies  |e  X-R(x)|  ^ — | |e  X-R(x)| | l°°£q  = 


■jX..  This  follows  from  the  fact  that  R e ^Co.ctJ  and  has  deflect 
zero,  since  if  e -R(x)  had  fewer  than  5 alternating  extreme 
points,  then  the  standard  argument  to  prove  alternation  in  R [o,a) 
can  be  used  to  find  R(x)*(a+bx)/( 1+cx+dx^)  t R ^ ^0 1 oQ  such  that 
| |e"X-R(x)|  |L»^0>aj  < | |e'X-R(x)|  |L»£0  >a-J  and  with  | Px~a  | , |p2- 


,-b 


l,l*,2“Cl  and  lqlq2~dl 


as  small  as  desired. 


Thus,  we  can  guaran- 


tee that  | | e X-R(x) | lL~fy  ^ < X2  holds  and  that  R(x)  also  has 

2 

unequal  negative  roots  (the  discriminant  of  1+cx+dx  can  be  made 

2 

arbitrarily  close  to  that  of  l+(q^+q2 )x+q^q  x ).  This,  of  course, 

x 

is  a contradiction  showing  that  e -R(x)  must  have  5 alternating 
extreme  points  on  [o, a]. 

Thus,  R(x)  is  the  best  approximation  to  e * on  Eo.ai]  from 
J?2  by  the  classical  alternation  theorem  and  also,  therefore  on 
Co,—).  Thus,  we  shall  complete  this  proof  by  showing  that  the 


“X  i r 

best  approximation  to  e from  ^lO,00)  does  not  have  real  poles. 

To  do  this,  we  computed  the  "best  approximation",  Rfx)  = 

(a+bx)/( l+cx+dx^)  to  e X from  R^fo^oJ  on  a 200,001  point  equally 

spaced  grid  imposed  on  fo,2o]  . The  computed  results  (rounded  to 

10  decimal  places)  were  a = .9911236330,  b = -.1577830783,  c = 

.6706780400,  d * .6494291043;  the  extreme  points  were  y^=0,  y = 

.2483,  y^=1.0852,  y^=3.2271  and  y^=13.1518.  The  absolute  errors 

at  the  extreme  points  were  .0088763670  (they  actually  differed  by 
-18  -x 

less  than  5x10  ) and  the  sign  of  e -R(x)  was  positive  at  y^. 

The  discrimant  of  the  denominator  was  -2.1481756150.  By  direct 
calculation,  it  can  be  easily  seen  that  E(x)  = c X-R(x)  > 0 and 
E'(x)  < 0 for  x > 20.  Thus,  | | E(x)  | | L~^0  oo)  = I I E(x)  | | L°°J0> 2pJ  ' 

Now,  let  us  assume  that  there  exists  R e having  negative  poles 
and  for  which  ||e  X-R(x)||  < ||E(x)||  holds,  where  for  the  re- 
mainder of  this  proof  ||*||  = 11*11  L°°[o  2oJ  ‘ T*1^s  lead  to  a 

contradiction  and  give  our  desired  result.  We  begin  by  noting 
that  | E ' ( x ) | < 1 for  all  x e fo,2oJ  since  -e  X and  -R'(x)  have 
opposite  signs  for  x e £o,  -a/bj  and  |R'(x)|  < 1 for  all  x since 
the  denominator  is  increasing  faster  than  the  absolute  value  of 
the  numerator  for  all  x.  For  x C [~a/b  ,2oJ  simply  look  at  the 

ratio  of  the  maximum  of  the  numerator  on  this  interval  and  the 

oJ- 

r value  of  the  denoininator^-a/b . Thus,  by  the  mean  value  theorem 

we  have  that  for  each  x E ^0,20],  |E(x)-E(x)|  < .00005  where  x de- 
notes a closest  grid  point  to  x.  Let  6 = .000054,  then  ||e||  - 
min{|E(y^)|:  i=l,...,5}  < 6 since  | E(y^)-E(y .)  | < .000002  for  i, 

j ■ 1 5.  Since  we  are  assuming  that  | |t-x-R(x) | | < ||e(x)||, 

\ — we  must  have  that  | |c  X-R(x)||  - min{|E(y.)|:  i*  1,...,5}  < 6. 

Now,  there  must  exist  iQ,  1 <_  iQ  <.  5 such  that  (~l)X°(R(y.  ) 

- • i 0 

- R(y.  ))  > 0 since  R f R,  Let  us  assume  that  max{(-l)  (R(y.)  - 

o 

R(y.)):  i » 1 5)  * (-1 )^(R(yj)-R(y^) ) . Next,  find  R*(x)  * 


(a*Aa+(b+-Ab)x)/( l+-(c+Ac)x+(d+-Ad)x^)  such  that  R*(y^)  * R(y^)  ♦ 
(-l)l<5  for  i = 1,...,4.  To  do  this  we  must  solve  the  linear 
system  Aa+Aby ^~Acy^( R(y^)+(-l) l6 )-Ady?(R(y ^ ) + (-l ) 1{)  = (l+cy^+dy?) 

(-l)lfi,  i“l,...,4.  Solving  this  with  Cramer's  rule,  with  the 
determinants  computed  by  cofactor  expansion  to  avojd  error  magni- 
fication by  divisions,  gives  Aa  = -.0000540000,  Ab  * .0004710533, 
Ac  ■ -.0005063974,  Ad  * .0019944507.  Using  this  R*,  we  have  that 
R(y5)  - R*(y$)  = -.0000746489  and  | | R(y.)-R*(y.) | - 6|  < 2 x 10_l8 
for  i * 1,...,4.  The  discrimant  of  R*  was  -2.156832218.  Now,  by 

construction,  C — 1 ) X ( R( y ^ ) ~^R*(y^) ) < 0,  i=l 4,  and  also,  for 

i * 1,...,5  we  must  have  ( — 1 ) 1 ( R( y . ) ) < 6,  since  ||e  X— K( x ) | | < 

^ c __ 

min{  | E(y^)  | : i*=l, . . . ,5}  + 6.  Now,  suppose  (-1)  ( R(y^)-R*(y^)  )>0 
(for,  if  not,  then  R = R*  and  we  have  our  desired  contradiction 
as  R*  has  non  real  poles).  Then,  we  have  that  6*  + R(y^)  = R*(y^) 
> R(y^)  and  that  6*  = (-l)^(R(y^)-R(y^))  ^ (-1 ) *( R(y^)-R(y^) ) , 

i * 1,...,5  so  that  |R(y^)-K(y^) | £ 6*,  i = 1,...,5.  Letting 
R(y^)  - R(y^)  = 6^,  i = 1,...,4,  we  can  estimate  the  coefficients 

of  R from  the  equations  R(y^)  = R(y^)  + 5^,  i = 1 4,  where  we 

know  that  |<5^|  <_  6*.  Writing  R(x)  = (a+Aa+(b+Ab)x)/ ( l+(c+Ac ) + 
(d+Ad)x^),  this  system  is  equivalent  to 

Aa+Aby  .-Acy . (R(y . )+6 . )~Ad.y?(R(y . )+6  • ) = ( 1+cy ,+dy?)6 . , 

■'i  ' 1 1 X X 11  1 1 111 

i * 1 A. 

Once  again  we  resort  to  Cramer's  rule  to  estimate  Ac  and  Ad. 
Writing  the  determinant  of  the  coefficients  of  this  system  as  the 
sum  of  four  determinants,  one  of  which  had  no  <5^ ' s in  it  (say  D), 
we  then  computed  D (=  .1194079538)  and  estimated  upper  bounds  for 
the  three  remaining  determinants,  subtracting  these  values  from  D 
showed  that  the  determinant  of  the  coefficients  > .1192288500. 
Calculating  upper  bounds  for  the  numerator  determinants  in  the 
formulas  for  Ac  and  Ad  and  then  estimating  gives  |Ac (<. 0012430121, 


|Ad|  < .0027602264.  Thus,  letting  and  denote  the  discrimi- 
nant of  the  denominators  of  R and  R,  respectively,  we  have  that 
|D^  - 5^ | < .0127092755.  Treating  the  other  cases  where  the 
maximum  of  (-1 ) 1 ( R(y.. )-R(y ^ ) ) occurs  for  i = 1,2,3,  or  4,  simil- 
arly, we  found  that  |D^-D^|  <_  .3959944800,  .1553098052, 
.0673900941  and  .0219551497,  respectively. 

We  conjecture  that  this  result  is  true  for  all  m ^ 2 . We 
close  this  section  by  stating  a local  characterization  and  local 
uniqueness  result  which  will  be  proved  in  a forthcoming  paperC3 ]. 

Definition  2.6.  R(x)  * (p,+...+p  xm  ^)/(qx+l)m  is  a local 

1 m m — — 

best  approximation  to  e X on  fo , °°)  i^  there  exists  a 6 > 0 such 
that  if  R(x)  = (p1+.  . -+Pmxin  *)/(qx+l)m  e Rm,  |p^-p^|  < 6, 

i*l, . . . ,m  and  | q— q | < 6 then  | |e  X-R(x)  | | | |e  X-R(x)  | | . I_f , in 

addition,  strict  inequality  hoHs  whenever  R(x)  t R(x)  then  R is 
said  to  be  locally  unique . 


THEOREM  2.7.  Let  m > 1.  Then  a nondegenerate  R(x)  = P(x)/Q(x)  *= 

(p,  + ...+p  xm  ^ )/ ( qx+1  )m  e R (i.e.  R t o,  P(x)  and  Q(x)  have  .no 
1 m m 

“X 

common  factors  and  q > 0)  ig  jj  best  local  approximation  to  e 

a.  _ _x 

from  Rm  on  JO, 00 ) it  and  only  if  e - R(x)  has  at  least  m + 2 

alternating  extreme  points . Whenever  this  occurs , R j_s  locally 

unique . 

We  remark  that  numerical  examples  seem  to  suggest  that  there 
exist  distinct  Rp  R£  £ satisfying  this  theorem. 


3 Numerical  Results 


Our  initial  algorithm  for  computing  approximations  to  e 

A* 

from  R and  R involved  linearizing  the  denominator  by  Taylor's 
m m 

Theorem  and  setting  up  an  iterative  procedure,  using  the  differ 

ential  correction  algorithm  to  compute  an  approximation  at  each 

m 


inner  stage.  Precisely,  for  R set  g(q  ,...,q  ,x) 

m 1 m 


n 

i“ 


iiq.x+l) 


ui 

and  define  ,x)  * x.H  (q.x+1)  for 

j ^1  i=li 

Vql,***'q«,x)  " 8<ql,,,'*VX>  " t qvVql"">qm,x)'  Thus  ’ 

m V-l 

if  R(x)  = P(x)/.n  (q.x+1),  0 < q.  < q.  < ...  < q is  an  approxi- 
1*1  — 1 — 2 — — ^m 

« “X 

mation  to  e at  some  step  in  the  algorithm,  then  a new  approxi- 

m_ } m 

mation  R(x)  = (p  +p,x+...+p  ,x  )/. II  (q.x+1)  is  'found  by  cal- 
ro  rl  rra-l  i=l  l 

culating  p ,...,p  ,q.,...,q  that  minimize  | |e  -(p  +...+p 

o m- 1 i m o oi”  I 

* • • ’ V*)  + * -+qA(q"l-  • * ,qm’  x)+^o^ql  ’ * * ' ’ VX))  1 1 

over  T a finite  subset  of  [o,n]  . Observe  that  the  denominator  in 
this  problem  is  precisely  the  linearization  of  g(q^ , . . . ,q  , x)  via 
Taylor's  Theorem  applied  to  the  first  m independent  variables. 
This  minimum  can  be  calculated  by  the  differential  correction 
algorithm.  Since  this  is  a linearization  of  the  problem  we  wish 
to  solve,  if  we  force  an  ordering  on  the  q^,...,q.^  to  get  a 
unique  solution,  it  seems  reasonable  to  expect  that  if  the  init- 
ial approximation  is  sufficiently  close  to  a best  approximation 
then  this  algorithm  will  converge  to  that  best  approximation. 

This  approximation  must  be  calculated  on  a large  interval 
(the  length  of  the  interval  needed  seems  to  increase  as  a func- 
tion of  m,  but  not  monotonical ly ) to  give  a candidate  for  a best 

— x 9 

approximation  to  e on  10,°°);  and  since  we  wish  to  get  an 
accurate  approximation  of  the  continuous  solution,  we  must  use  a 
fairly  fine  mesh  so  that  card  (T)  will  be  large.  Since  the  diff- 
erential correction  algorithm  tends  to  become  unstable  as  card 
(T)  grows  large,  we  decided  to  use  the  Reraes-Di f cor  algorithm 
M for  calculating  the  linearized  minimum.  We  did  this  because 
this  algorithm  applies  the  differential  correction  algorithm  to 
certain  (small)  subsets  of  T chosen  in  such  a manner  (depending 
upon  alternation)  that  convergence  to’ the  solution  on  the  whole 
space  occurs.  Thus,  we  had  no  a priori  guarantee  that  this  would 
work  since  a standard  alternation  theory  does  not  exist  for  the 


linearized  minimization  problem  due  to  the  addition  of  the  con- 
straints on  { q . }™  , . However,  in  spite  of  this,  the  results  of 
n l-l 

the  algorithm  are  acceptable  in  that  the  algorithm  returned  (or 

tried  to  return)  a solution  in  which  the  q.'s  coalesced  and  for 

— m j-jj 

which  the  error  curve  e - P(x)/^n^(qx+l)  (the  final  coalesced 
approximation)  alternated  on  m+2  points  of  T.  Thus,  by  an  alter- 
nation theorem  we  have  proved  DJ,  we  have  a best  local  approxi- 

+*0 

mation  from  R and  as  we  conjectured  earlier;  therefore,  also 
m 

from  R ; we  also  ran  an  algorithm  of  this  character  for  the  class 
R . In  all  cases  it  has  given  the  same  results  as  the  above 

ID 

algorithm  applied  to  R . A precise  study  of  these  algorithms 

remains  to  be  done  and  we  conjecture  that  convergence  results 

can  be  proved  for  both  R and  R , at  least  using  the  differential 

mm 

correction  algorithm  for  the  inner  minimization. 

We  have  run  these  algorithms  for  various  values  of  m using 

a grid  with  spacing  .002  imposed  on  an  interval  [o,n],  where 

N is  chosen  by  trial  and  error  so  that  the  computed  results  make 

it  apparent  that  the  error  norm  on  fN,°°)  is  smaller  than  the 

error  on  M-  The  computations  were  done  on  a UNI VAC  1106, 

which  has  roughly  18  digits  of  accuracy  in  double  precision. 

Initially,  we  start  with  PQ=1.  p^= . . •*'Pm_1=0 » qj  = j=l,...,m 

and  ran  the  program  with  additional  constraint? q^  <_  q^+j  ~ DIFF 

where  DIFF  is  a nonnegative  parameter.  If  DIFF  > 0 we  found 

that  the  computed  q^’s  immediately  differed  by  exactly  DIFF,  and 

if  DIFF  was  set  equal  to  0 the  algorithm  ran  and  the  computed 

q^'s  coalesced.  The  algorithm  for  R^  had  a linearization  in 

-9  which  the  denominator  of  the  approximation  is  g*m  x(qx+l)m  * ♦ 

£(l-m)qx+j)  (qx+l)m  * = q<|^(q,x)  + ^(q.x)  where  q is  the  value 

from  the  previous  approximation.  Here  the  initialization  was 

P “ 1,  p = ...  - p = 0,  q = 1/n.  Although  we  allowed  this 
o 1 m- 1 

program  to  run  for  seven  outer  iterations,  the  coefficients  near- 
ly always  stopped  changing  after  four  or  five  outer  iterations, 


and  the  computed  absolute  values  of  the  errors  at  the  m + 2 
extreme  points  agreed  to  at  least  fourteen  significant  figures. 
The  results  are  shown  in  the  table  below,  with  the  error  of  l 5j 
given  in  the  last  column  for  comparison  purposes.  The  sign 
attached  to  the  last  extreme  point  is  the  sign  of  E(x)  at  that 
point.  It  should  be  noted  that  it  is  possible  (although  unlike- 
ly) that  in  some  cases  there  is  a local  best  approximation  other 
than  ours  which  gives  a smaller  error.  In  the  m = 3 case  we 
have  found  another  local  best  approximation  (with  g = 1.05109 
and  | | error | | = 1.33720  (-02)  and  in  the  m = 6 case  there  appear 
to  be  at  least  three  local  best  approximations  other  than  the  one 
in  the  table. 

Finally,  we  would  like  to  thank  Professor  R.S.  Varga  for 
bringing  [aJ  to  our  attention  where  some  of  the  results  of  this 
paper  and  of  f3j  have  also  been  obtained  independently. 

Table  of  Numerical  Results 


m 

last  ext.  pt. 

q 

| |error 

II 

| | error | | 

1 [5  J 

2 

12.932+ 

.52416 

2.27093 

(-02) 

2.49038 

(-02) 

3 

37.250- 

.27127 

8.04713 

(-03) 

1.5053 

(-02) 

4 

83.814+ 

.17797 

3.30771 

(-03) 

7.85325 

(-03) 

5 

80.802+ 

.27866 

1.16064 

(-03) 

3.05486 

(-03) 

6 

152.352- 

.19296 

4.26252 

(-04) 

8.89316 

(-04) 
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